Microphone array sub-band speech recognition
نویسندگان
چکیده
This paper proposes the integration of sub-band speech recognition with a microphone array. A broadband beamforming microphone array allows for natural integration with sub-band speech recognition as the beamformer is typically implemented as a combination of band-limited sub-arrays. In this paper, rather than recombining the sub-array outputs to give a single enhanced output, we propose the fusion of separate hidden Markov models trained on each subarray frequency band. In addition, a dynamic sub-band weighting scheme is proposed in which the crossand auto-spectral densities of the microphone array inputs are used to estimate the reliability of each frequency band. The microphone array sub-band system is evaluated on an isolated digit recognition task and compared to the standard full-band approach. The results of the proposed dynamic weighting scheme are compared to those obtained using both fixed equal sub-band weights, as well as optimal sub-band weights calculated from a priori knowledge of the correct results.
منابع مشابه
Multi-Channel Sub-Band Speech Recognition
Two distinct fields of research into robust speech recognition are the use of microphone arrays for signal enhancement and the use of independent frequency sub-band models for robust recognition. In this article, we propose and investigate the integration of these two techniques on two different levels. First, a broad-band beamforming microphone array allows for natural integration with sub-ban...
متن کاملAdaptive Sub band GSC Beam forming using Linear Microphone-Array for Noise Reduction/Speech Enhancement
3 Acknowledgement 5 List of Tables 9 List of Figures 10
متن کاملRobust continuous speech recognition system based on a microphone array
In this paper, a robust speech recognition system for videoconference applications is presented based on a microphone array. By means of a microphone array, the speech recognition system is able to know the position of the users and increase the signal-to-noise (SNR) ratio between the desired speaker signal and the interferences from the other users. The user positions are estimated by means of...
متن کاملSpeech Enhancement Withmicrophone Spectral Subtraction in Real
It is very important to capture distant-talking speech with high quality for teleconferencing systems or voice-controlled systems. For this purpose, microphone array steering and Fourier spectral subtraction, for example, are ideal candidates. A combination technique using both microphone array steering and Fourier spectral subtraction has also been proposed to improve performance. However, it ...
متن کاملNoise Reduction Using an Adaptive Microphone Array in a Car - A Speech Recognition Evaluation
TIGS paper describes an evaluation of an adaptive microphone array with respect to speech recognition performauce in a tar. The microphone array is compared KO two conventional microphones of different rypes. The speech recognition device is aimed to be a part of a man/maehine-interface between the driver and tar information services.
متن کامل